Preferential compactness of networks 
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We introduce evolving networks where new vertices preferentially connect to the more central parts 
of a network. This makes such networks compact. Finite networks grown under the preferential 
compactness mechanism have complex architectures, but infinite ones tend towards the opposite, 
having rapidly decreasing distributions of connections. We present an analytical solution of the 
problem for tree-like networks. Our approach links a collective self-optimization mechanism of the 
emergence of complex network architectures to self-organization mechanisms. 
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I. INTRODUCTION 

Self-organization versus optimization — these two pos- 
sible explanations of specific network architectures have 
been extensively discussed in the last few years. The for- 
mer mechanism is more developed and is a standard ex- 
planation for fat-tailed degree distributions in networks 
in modern physics literature Q, |^ |^ . The optimization 
concept of the emergence of a complex network structure 
is more common for engineers and computer scientists 
0, 0,0,0- (For more general aspects of self-organization 
and optimization concepts see Refs. 0,0 and Ref. [lol| . 
respectively.) 

Usually these two explanations are seen as mutually 
contradicting However, the process of optimiza- 

tion of a complex network includes not an external will 
(a single designer) but numerous agents. Each of these 
agents (e.g., vertices) solves its "selfish", individual op- 
timization problem — optimizes a number of trade-offs — 
and tries to arrange its connections in the best (for this 
agent) way. Speaking in simple terms, one can even treat 
optimization of this kind in terms of self-organization and 
vice versa. 

To be more concrete, we consider a more narrow class 
of models. The simplest one, a model of a growing tree, 
where each new vertex becomes attached to one of ex- 
isting vertices selected in an optimal way, was proposed 
by Fabrikant, Koutsoupias, and Papadimitriou, Ref. 
(for the detailed study of the model, see Ref. see 
also Ref. 14] ). In the FKP model vertices have random 
geographical coordinates in a restricted area. A new ver- 
tex is attached to one of vertices chosen to minimize a 
linear combination of (i) the resulting shortest-network- 
path distance between the new vertex and the root of the 
tree and (ii) the Euclidean length of the new connection. 

The FKP model exploits the competition of two objec- 
tives: a desire to connect to the center of the network. 
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which produces compact networks, and a desire to have 
a physically short, cheap connection. The geographical 
coordinates of vertices are taken to be random, as well 
as, in fact, the Euclidean distances between vertices. So, 
one may roughly say that in this scheme, the combina- 
tion of a length to the center of a network and a random 
number is optimized. Consequently, the competition is 
actually between compactness and randomness. 

In this paper we study the interplay between these 
two tendencies to compactness and to randomness as the 
mechanism of generation of complex networks. We for- 
mulate the problem in terms of preferential attachment 
and so link the optimization models to self-organization 
ones. 

We introduce the following model of a growing network 
(tree) : 

(1) At each time step, a vertex is added to the network. 

(2) This vertex is attached to a vertex chosen with 
probability proportional to a function r{i) of its 
distance £ from the root. 

The preference function r{£) determines the structure of 
the network. As is natural, if r{£) = const, we arrive at 
the random attachment growth and at an exponential de- 
gree distribution. We show that if r{£) is rapidly decreas- 
ing, the resulting degree distribution of the finite network 
is of a complex form with several distinct parts. On the 
other hand, we find that in the infinite network limit, the 
degree distribution is a rapidly decreasing function. Note 
the difference of our approach from that of Ref. where 
preferential attachment of new vertices was determined 
by geographical closeness (Euclidean distance), see also 

Refs. Eillilil- 

Our results, typical resulting degree distributions in 
finite networks are shown in Figs, ^^a), |2Ia), and|3Ia). 
Note that the numbers of vertices in these examples far 
exceed typical sizes of networks in simulations of network 
optimization (see, e.g., Refs. 0^3) and in most of the 
studied real networks. In this sense, the model provides 
complex degree distributions even in "large" networks. 
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FIG. 1: (a) The degree distribution of the network growing 
under the mechanism of preferential compactness with the 
preference function r{(.) = x*. The value of the parameter 
X is 0.5, the size of the network is t = 8825. Equation @ 
gives r = 47.72 for these values. The width of the plateau 
is XT — 24, the height of the plateau 1/x = 2. (b) The dis- 
tribution Ad(£) in this network (the mean number of vertices 
at a distance £ from the root). Notice that the number of 
the nearest neighbors of the root, AI{£ = 1) = r, here is a 
vanishingly small fraction of the total number of vertices. 



In the present paper we explain the nature of this be- 
havior of the degree distribution in networks of this kind. 
We mean the combination of a complex form in the fi- 
nite networks, and a "trivial" one in the thermodynamic 
limit. Furthermore, we show that one can essentially ex- 
tend the range of the observation of the complex degree 
distributions by passing to more realistic models. 

Note that a tree structure was assumed only to simplify 
analytical calculations (it is easy to find intervertex dis- 
tances in trees). The basic idea of the model, that is, the 
specific preferential linking that makes a network more 
compact, may be also applied to networks with loops. 



II. EVOLUTION EQUATIONS 

The tree structure of the network simplifies the prob- 
lem. It is convenient to treat edges as directed, outgoing 
from newer vertices. So, we consider the statistics of in- 
degree k of vertices, and the quantity of interest is the 
average number N{k,£,t) of vertices of in-degree k at a 
distance £ from the root at time t. ("Time" is the cur- 
rent number vertices in the network.) Here the average 
means averaging over the entire ensemble of networks at 
time t. This is a set of networks which can emerge at time 
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FIG. 2: (a) The same as in Fig. [T] but x = 0.1, t = 9005. 
Equation ||5J gives for these values r — 292. The width of the 
plateau is xt = 29, the height of the plateau is 1/x = 10. (b) 
The distribution M(£) in this network. Note that M(£ = 1) 
is noticeable with these values of x and t. 
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FIG. 3: (a) The same as in Figs [H and El but s = 0.01, 
t = 9821. Equation © gives for these values r = 1280. The 
width of the plateau is xt = 13, the height of the plateau 
is_l/2; = 100. (b) The distribution M(^) in this network. 
M{£ — 1) is noticeable with these values of x and t, which 
results in the pronounced plateau in the degree distribution. 
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i as a result of the evolution of the model with statistical 
weights determined by the rules of this evolution. 

We use an approach, standard in networks with pref- 
erential hnking [20, 123 ■ Let M{i, t) be the average num- 
ber of vertices at a distance i from the root at time 
t. The mean in-degree of the root vertex is equal to 
the mean number of the nearest neighbors of the root, 
k{t = 0,t) = M{i = l,t). M{i,t) = J2k=QNik,i,t). 
Note that in this tree, the out-degree of any non-root 
vertex is 1, so the total degree of a vertex is A; -|- 1. 

One can see that the evolution of M{i, t) is described 
by the equation: 



M(£,i+ 1) = M{t,t) 



r(^- 1) 



Mie-l,t), (1) 



where 



N{k,£,t+ 1) - N{k,£,t) in Eq. (g)) can be substituted 
by the corresponding time derivatives. 

The direct way to solve the problem is as follows: (i) 
find the solution of Eq. in terms of yet unknown func- 
tion i.e., M{i,t,'E{t))- (ii) substitute this solution 
into Eq. J^Jl [or Eq. 121] and obtain S(i); (iii) with this 
T.{t), obtain the solution 14(1,1), (iv) solve Eq. Q with 
the known and 'M{£,t). 

Technically, it is convenient to use the mean number 
of the nearest neighbors of the root, M{£ = l,t) = T{t), 
instead of One can see that r(t) and S(t) are related 
to each other in a simple way. Without restriction of 
generality, we set r(0) = 1, so Eq. readily gives the 
relation: 



dM(l ^l,t) _ dT{t) _ 1 



dt 



dt 



(5) 



(2) Then, ioYMi,{T{t)) = M(l,t) and N k,t{T{t)) = N{k,£,t), 
we have the equations: 



An initial condition is M{£,t 1) Si^o, where Si_o is 
the Kronecker symbol. That is, the growth starts from 
a single vertex — the root, and t is exactly the number 
of vertices in the net. As is natural, Eqs. (Q) and ^ 
guarantee that 'M{£ = 0, i > 1) = 1. The solution M{£, t) 
must satisfy the condition 



Y,M{£,t)^t. 



(3) 



It is important that Eq. ^ does not contain N{k,i,t). 
The evolution of N{k,£,t) is governed by the equation: 



dr 



and 



dNkAr) 
dr 



r(^-l)M,_i(r) 



(6) 



r{£)[Nk^iAT)~NkAr)] 
+r(f-l)M,_i(r)<5fc,o- 



(7) 



The solution of Eq. ® is 



N{k,£,t+1) = N{k,£,t) 
+ ^^[N{k-l,£,t)^N{k,£,t)] 
r(£ - 1) 

+ ^^^M{£-l,t)Sk,o. (4) 

The second term on the right-hand side of this equation 
accounts for the attachment of new vertices to vertices 
of an ^th shell. The third term is due to emergence of 
extra leaves in the ith shell when new vertices become 
attached to vertices at distance £ — I from the root. 



III. CALCULATIONS 

We must first find a solution M{£,t) of Eq. (P), substi- 
tute this solution to Eq. and then obtain the result 
N{k,£,t). Note that the solutions are non-stationary. 

We consider a large t asymptotic behavior, so that 
the differences 'M{£, t + I) - 'M(£, t) in Eq. (HJ and 



Mdr)^^Yiri^). (8) 

Let us now, for simplicity, use a concrete form of the 
preference function, r{£) = x^, where 0<2:<1. [a; = l 
corresponds to random (indifferent) attachment; x > 1 
with large enough x results in chain-like structures.] In 
this case, the solution (|SJ) is "Poisson"-like: 

M,(r) = ix^(^-l)/V^ (9) 
with T satisfying the equation: 

The final formulas will express the dynamics of the model 
in terms of r, and the time dependence T{t) is the result 
of the solution of the transcendental equation (|10|) . 
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The asymptotics of the resulting dependence of the 
number of the nearest neighbors of the root, 



-it) - e 



2x 



ln(l/x) 



Iniexp ^21n(l/a;) Int , (11) 



can be obtained in situations where the peak of is 
well separated from £ — 0, so the saddle point method is 
applicable. Actually, for the validity of the asymptotic 
form l|ll|l. the point of the maximum of Mg, ^max ~ ^ 
must essentially exceed the width S£ of this distribution. 
(J is the average separation of a vertex from the root.) In 
its turn, the dispersion Si must be essentially larger than 
1, which is not the case for physically reasonable values 
of parameters. 

Expression Hll|) can be rewritten in the following form 



/^^y/2huTM/^/hrt 



(12) 



That is, T grows slower than any power of t but at finite 
t, looks rather as a "power law" with a slowly varying 
exponent (a "multifractal" appearance). 

The characteristics -^max (a typical distance from the 
root) and S£ of the distribution have asymptotics: 



In^ 



and 



2 ln(l/2;) + 



ln(l/x) 



1 1 



-1/2 



(13) 



(14) 



So that, very roughly, ^max ~ y/2\nt/ ln(l/a::). 

Unfortunately, these compact asymptotic formulas be- 
come precise only in astronomically large networks. For 
"physically reasonable" values of parameters (see Figs ^ 
13 and 131, it is simpler to numerically obtain T{t) from 
Eq. 11U|) than to use cumbersome next order asymptotic 
formulas. 

The solution of Eq. with the substituted A/^(t), 
formula @, is 



(-1) 



i+i 



rj.l{l+l)/2 



e-1 



.i=0 



k 
3=0 



l-l-i 



k + l-l-i\ {x'^tY 



1 



(15) 



Expressions © and H15|l with T{t) obtained from 
Eq. H10(l give the complete solution of the problem. It 
is useful to write out particular cases of the cumbersome 
expression (|15|) : for the nearest neighbors of the root, we 
have 



1 °° 



2^0 

{xtY 

9,1 



(16) 



i=k+l 

for the second- nearest neighbors, we have 



NkAr) = ^[k + l-x^T 



{x'^tY 



i=0 



(17) 



and so on. 



IV. RESULTS AND DISCUSSION 

Typical N{k, t) = N{k, £, t), obtained by the direct 
summation of expression (|15|l , are shown in Figs ^a) , 
|2Ia), and |3Ja). [A degree distribution is P{k,t) ~ 
N{k, t)/t.] The corresponding mean numbers of vertices 
at a distance £ from the root are shown in Figs^b),|2Kb), 
andUb). 

The degree distributions consist of three parts: there is 
a plateau between two regions of rapid decrease at small 
and large degrees. Note that the main fraction of vertices 
are leaves (vertices of degree 1). 

What is the nature of this complex form of the de- 
gree distribution, i.e., the nature of the plateau? The in- 
spection of expression ifT^ shows that N{k,£,t) rapidly 
decreases with k at each £ > 1. [Check, e.g., in the par- 
ticular case £ — 2, relation H17(l .] On the other hand, 
expression H16|l {Nk,i=i) shows that the plateau is de- 
termined by the contribution of the nearest neighbors of 
the root. Recall that the total number of these vertices 
is T{t). One can see from expression that Nk,t=i is 
close to 1/a; as fc < xt. Above this value of in-degree, 
N k.i=i is a rapidly decreasing dependence. Note that 
Efc^fc,fci(0 = Mfci(t) = T{t). Figs.[Tl;a),El;a), and 
ini^a) demonstrate that indeed the width of the plateau is 
XT, and its height is l/x. 

So, for the presence of the plateau in the degree distri- 
bution two conditions must be simultaneously fulfilled: 
(i) XT must be much greater than the mean in-degree of 
the network, i.e., much greater than 1; (ii) \/x must be 
large enough. Figs.[2ta) and^fa) show situations in fi- 
nite networks where these conditions are satisfied. One 
can check that if the size of the network tends to infinity, 
the relative number of the nearest neighbors of the root 
approaches zero, T/t 0, and the plateau disappears. 

Thus, it is the nearest neighbors of the root that make 
the degree distribution of the network complex. The de- 
crease of the fraction of these vertices in large networks 
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FIG. 4: The mean separation of a vertex from the root as a 
function of the degree of this vertex, x = 0.1, t — 20142, and 
so r = 400. The dependence exponentially approaches 1 as A; 



mcreases. 



makes the plateau unobservable and so trivializes the 
form of the degree distribution. We believe that this is a 
rather generic feature of networks of this kind. We mean 
networks, generated by optimization related algorithms, 
where a function of an intervertex distance is used in 
optimization (or for selection a vertex for attachment). 
Quite similarly to what we observed, in the FKP model, 
the "complex" , power-law part of the degree distribution 
disappears in the thermodynamic limit and vertices are 
almost surely leaves [T^ . 

One might consider this trivialization in the thermody- 
namic limit as a defect of these constructions. However, 
most of real networks are not so large. Empirical stud- 
ies and simulations are mostly made for networks smaller 
than the networks in Figs^BI Moreover, the typical sizes 
of networks in simulations of optimization mechanisms 
are 200, 100, and 50 vertices (see, e.g., Refs. 0, IToj). 
In this sense, we observe complex architectures in quite 
large networks. 

The diminishing of the fraction of the nearest neigh- 
bors of the root, r(t)/i, in the network, which we consider 
in this paper, is a direct consequence of the narrow distri- 
bution of vertices over the distance £ from the root [see 
Figs n^b) , [2Ib) , and|3Ib)]. As the network grows, the 
mean distance of a vertex from the root, i increases as 
Vint (which is meanwhile slower than a standard loga- 
rithmic dependence typical for networks with the small- 
world effect). Simultaneously, the relative width of the 
distribution decreases [see relation H13|) ]. and approaches 
zero in the large network limit, it is important that this 
behavior of the relative width is practically generic for 
networks with the small- world effect. In other words, in 
infinitely large networks with the small- world effect, all 
vertices are almost surely mutually equidistant. So, in 
the large networks, even rapidly, exponentially decreas- 
ing preference function r{£) oc leads to attachments 
mostly to vertices at the distance J from the root. Con- 
sequently, in the large networks, the nearest neighbors of 
the root have few chances to attract new connections 

In another form, the result of this evolution is shown in 
Fig. 21 the average separation l{k) of a vertex from the 



root as a function of the degree of this vertex in a typical 
situation. This figure demonstrates that most vertices 
have the same separation from the root. Note that the 
complementary characteristic, the mean in-degree of a 
vertex at distance i from the root shows a quite different 
behavior: 



ki£,t) 



M{£,t) ~ £+1 



(18) 



[we have used relation @]. 
A number of modifications of the model are possible: 

(1) An analytical solution has been obtained for the 
tree network, but in principle, one may also allow con- 
nections between already existing vertices using the same 
preference function. 

(2) One may introduce the variation of the preference 
function with the network growth, e.g., use r{£,t) = 
x^{t). This allows us to increase the range of network 
sizes, where the degree distribution has a complex form, 
but the model becomes ugly. 

(3) One may introduce an additional finite probability 
that a new vertex is attached to the root of the network. 
This leads to a finite fraction of the nearest neighbors 
of the root [T{t) oc t, the condensation of edges on the 
root] , and so to the presence of the plateau in the degree 
distribution even in the infinite network. 

(4) If, nevertheless, we want to avoid the condensation 
of edges, we may, for example, modify the rules of the 
model in the following way. We introduce equiprobable 
attachment of a new vertex to the root or to any of its 
nearest neighbors. In other words, the "center" of the 
network consists now of the root and its nearest neigh- 
bors. 

Let us discuss the last possibility in more detail. The 
evolution of the average degree of the root, T{t), is de- 
scribed by the equation: 



dr 



1 



(19) 



where p is the probability that a new vertex becomes 
attached to the "center" of the network. The initial con- 
dition is T{t = 0) = 0, so the solution of the equation is 

It is convenient to use r as a new "time" variable, 
instead of t. In this case, the number of vertices in the 
"center" of the network per "time" step increases by 1. 
On the other hand, the number of vertices, which become 
attached to the "center" vertices per time step, increases 
with T as dt/dr = t /p. So, the equation for the average 
number of the vertices of in-degree k in the "center" of 
the network is of the form: 



N{k,T + l) = N{k,T)+5kfl 

"iv(fc-i,T) N{k,Ty 



T 
P 



(20) 
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The solution of this equation, N{k,T), has a kink-hke 
form: N{k < t/p,t) = p, 7V'(fc > t/p,t) = 0. So, the 
width of the plateau is pr. Recall that in the original 
model, we had the plateau width xt with an extremely 
slowly growing T{t). In contrast, in the present situation, 
the number of vertices in the "center" of the network, 
T{t), and the plateau width rapidly grow as Vi. This 
allows one to observe highly connected vertices in larger 
networks than in the original net. Note, however, that in 
the present case, in average, there is no more than one 
vertex of each degree in the plateau part of the distribu- 
tion. 



V. SUMMARY 

We have proposed a class of networks exhibiting the 
mechanism of preferential compactness. New vertices 
prefer to be attached to the central part of a net, which 
favours compact structures. 

Our approach links optimization concepts of complex 
networks to the self-organization mechanism. These net- 
works, while growing, self-organize into compact struc- 
tures, whose linear sizes are determined by the assumed 
preference functions. These thus fix a balance between 
tendencies to compactness and to randomness in the net- 
work growth. 

As an example, we have studied a tree-like growing 
compact network (the mean intervertex distance grows 
slower than the logarithm of the number of vertices). We 
have found that the degree distribution of the network 
has a complex (non-power-law, in our particular case) 



form only in the finite networks. The degree distribution 
trivializes in the large network limit, where this distribu- 
tion consists only of a rapidly decreasing part. We have 
shown that this "trivialization" of the network structure 
is practically unavoidable if an intervertex distance is 
used in the optimization (or preferential linking) process. 
However, in our networks, the infinite network limit is 
approached slowly. So, the structures of even reasonably 
large (but still finite) networks, generated in this way, 
are complex. The relevant network sizes, where a com- 
plex distribution of connections is observable, turn out to 
be typical for many real and simulated networks. Com- 
parison with the FKP model shows that the observed 
behavior should be common for networks whose evolu- 
tion is directly determined by separation between their 
vertices. 

In summary, we presented a new effective approach 
to what one can call the "collective self-optimization" of 
networks. 
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